Perceptual Speech Enhancement Using a Hilbert Transform Based Time-Frequency Representation of Speech
نویسندگان
چکیده
A new Time-Frequency (TF) representation of speech signal is introduced and used for speech enhancement. TF representation and speech enhancement algorithm are both based on perceptual properties of human auditory system in which the concept of band analysis is exploited. TF representation is carried out by the means of analytic decomposition of speech signal in the hearing Critical Bands (CB) where the envelope and phase components of the analytic signals are used. For the purpose of enhancement a time varying gain function is used which takes into account the threshold of hearing. This threshold is calculated on the basis of masking effects using a perception model. Signal is reconstructed from the modified envelopes and the phases of noisy signal decomposed in critical bands. Experiments show that using the threshold of hearing in which temporal masking is included can effectively eliminate the musical noise without a significant decrease in intelligibility. Results using noise estimation by Voice Activity Detector (VAD) and Speech Presence Probability (SPP) are reported on.
منابع مشابه
A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain
Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...
متن کاملSpeech Enhancement using Adaptive Data-Based Dictionary Learning
In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...
متن کاملSpeech enhancement based on hidden Markov model using sparse code shrinkage
This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...
متن کاملSpeech Enhancement Using Hilbert Spectrum and Wavelet Packet Based Soft-Thresholding
A method of and a system for speech enhancement consists of Hilbert spectrum and wavelet packet analysis is studied. We implement ISA to separate speech and interfering signals from single mixture and wavelet packet based softthresholding algorithm to enhance the quality of target speech. The mixed signal is projected onto time-frequency (TF) space using empirical mode decomposition (EMD) based...
متن کاملA Generalized Time–Frequency Subtraction Method for Robust Speech Enhancement Based on Wavelet Filter Banks Modeling of Human Auditory System
We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-tonoise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the residual noise and improve the intelligibility of speech. The proposed method is a generalized tim...
متن کامل